Provenance Management for Data Quality Assessment

نویسندگان

  • Hua Zheng
  • Qinghua Zhu
  • Kewen Wu
چکیده

The ultimate goal of data quality management (DQM) is to improve the data quality (DQ) to facilitate enterprises decision-making, and the data quality assessment (DQA) is an important aspect in the process of DQM. Existing research in DQA focuses on the establishment of evaluation indicators and quantified methods in specific areas of application, but does not take into account the evolution of the data. For the current context of complex heterogeneous data environment, DQA framework based on provenance and SOA is designed, and provenance management process is focused to describe, finally the provenance model is defined and implemented. Overall, an example described in the paper demonstrates the necessity and feasibility of introducing provenance into DQA.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Representing Interoperable Provenance Descriptions for ETL Workflows

The increasing availability of data on the Web provided by the emergence of Web 2.0 applications and, more recently by Linked Data, brought additional complexity to data management tasks, where the number of available data sources and their associated heterogeneity drastically increases. In this scenario, where data is reused and repurposed on a new scale, the pattern expressed as Extract-Trans...

متن کامل

A QUAL: A Provenance-Aware Quality Model

In this paper we present a model for quality assessment over linked data. This model has been designed to align with emerging standards for provenance on the Web to enable agents to reason about data provenance when performing quality assessment. The model also enables quality assessment provenance to be represented, thus allowing agents to make decisions about re-use of existing assessments. W...

متن کامل

Provenance Information in the Web of Data

The openness of the Web and the ease to combine linked data from different sources creates new challenges. Systems that consume linked data must evaluate quality and trustworthiness of the data. A common approach for data quality assessment is the analysis of provenance information. For this reason, this paper discusses provenance of data on the Web and proposes a suitable provenance model. Whi...

متن کامل

QualityTrails: Data Quality Provenance as a Basis for Sensemaking

Visual Analytics prototypes increasingly support human sensemaking through providing Provenance information. For data analysts the challenge of knowledge generation starts with assessing the quality of a data set, but Provenance is not yet utilized to aid this task. This position paper aims at characterizing the complexity of Visual Analytics methods introducing Provenance in Data Quality by hi...

متن کامل

Using Web Data Provenance for Quality Assessment

The Web of Data cannot be a trustworthy data source unless an approach for evaluating the quality of data on the Web is established and integrated as part of the data publication and access process. In this paper, we propose an approach of using provenance information about the data on the Web to assess their quality and trustworthiness. Our contributions include a model for Web data provenance...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JSW

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2012